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Abstract 



f^ • We study two types of stochastic processes, a mean-field spatial system of interacting Fisher- 

Wright diffusions with an inferior and an advantageous type with rare mutation (inferior to ad- 
vantageous) and a (mean-field) spatial system of supercritical branching random walks with an 

i-Q ' additional deathrate which is quadratic in the local number of particles. The former describes a 

standard two-type population under selection, mutation, the latter models describe a population 
under scarce resources causing additional death at high local population intensity. Geographic 
space is modelled by {1, • • • ,N}. The first process starts in an initial state with only the inferior 
type present or an exchangeable configuration and the second one with a single initial particle. 
This material is a special case of the theory developed in [DGscl . 
^ ' We study the behaviour in two time windows, first between time and T and secondly after a 

large time when in the Fisher- Wright model the rare mutants succeed respectively in the branching 
random walk the particle population reaches a positive spatial intensity. It is shown that the 
second phase for both models sets in after time a -1 log N, if N is the size of geographic space 
and iV -1 the rare mutation rate and a € (0, oo) depends on the other parameters. We identify 

C ' the limit dynamics in both time windows and for both models as a nonlinear Markov dynamic 

(McKean-Vlasov dynamic) respectively a corresponding random entrance law from time — oo of 
this dynamic. 

Finally we explain that the two processes are just two sides of the very same coin, a fact 
arising from duality, in particular the particle model generates the genealogy of the Fisher- Wright 
diffusions with selection and mutation. We discuss the extension of this duality relation to a 
multitypc model with more than two types. 
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INTRODUCTION 3 

Introduction 

0.1 Motivation and background 

We study here features of the longtime behavior of two models for the stochastic evolution of pop- 
ulations, the classical (mean-field) spatial version of a system of interacting Fisher- Wright diffusions 
with selection and mutation on the one hand and a logistic spatial branching particle model both on 
geographic space {1, • • • , N}. We shall later explain the mathematical relation between them. 

Fisher- Wright model. This process comes from population genetics and models a population of 
individuals of two types evolving under migration, resampling, selection and mutation. It is the 
many individual limit of a discrete model. Migration here means individuals move in geographic 
space, resampling that pairs are replaced by an offspring pair each choosing a parent at random and 
adopting the parents type, mutation is a spontaneous change of type of an individual and under 
selection the choice of the parent in the resampling event is biased according to the parents' fitness. 

Here we are particularly concerned with a situation where we have an inferior type and an advan- 
tageous type. The case we are interested in is that the mutation rate from inferior to advantageous 
is very small so that in finite time we expect 0(l)-many mutations in all of space as space gets large. 
We want to follow the population through the emergence and fixation of the whole population in the 
advantageous type. 

Logistic branching model. Here we consider a population of particles which migrate in space, have 
offspring at a certain rate s, die with a certain rate (all particles act independently) but here the risk 
of death is at a rate increasing with the population size of the site and being zero for one particle. 
The latter mechanism induces an interaction of the families, which therefore do not evolve anymore 
independently of each other. 

Here for this model we want to see how the population starting from one particle spreads and 
eventually colonizes the whole space as time evolves, meaning that a positive spatial intensity is 
reached and a local equilibrium situation arises where locally the process neither becomes extinct 
nor grows and becomes infinitely large as t — > oo. This is in contrast to the behaviour of classical 
branching models with their survival versus extinction dichotomy in finite geographic space and reflects 
the limited resources in a given colony. 

McKean- Vlasov equation and random entrance laws. The techniques we use to study the questions 
raised above for two models is the mean-field limit, where we choose migration to occur according to 
the uniform distribution on {1, • • • , N} and rare mutation having rate iV _1 and where we let TV — > oo. 
As limit dynamics a nonlinear Markov process of the type first introduced by McKean |Mc| arises, i.e. 
an evolution with a generator where the parameters depend on the current state of the process and on 
the current law. The transition probabilities solve the McKean-Vlasov equation which has a similar 
structure to the equations introduced by Vlasov to describe the dynamics of a plasma consisting of 
charged particles with long-range interaction. A similar scenario holds for the branching particle 
system. 

The first rigorous and systematic analysis of mean-field models and resulting McKean-Vlasov dy- 
namics is Gartner's fundamental paper in 1988 [Gar] which established the existence and uniqueness 
of weak solutions for a general class of McKean-Vlasov equations and the associated non-linear mar- 
tingale problems but under the condition that the diffusion matrix is strictly positive definite. Hence 
it does not cover the case dealt with in this paper due to the fact that for the Fisher Wright diffusions 
the diffusion function vanishes at the boundary. 

We must also extend here the methodology of the McKean-Vlasov equation further in order to 
describe the limiting behaviour in different time windows. In order to also consider a late time 
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window we need first of all to introduce the notion of an entrance law from time — oo (our time 
parameter has the form TV +t, tel and TV — > oo as N — » oo) but since in the initial time phase 
some randomness is involved (rare mutation respectively very small early particle intensity) we even 
have to work with random entrance laws to the McKcan-Vlasov equation. 

Duality. The mathematical structure which relates our two models is the fact that they are 
in duality, i.e. expectations of certain functionals under one dynamic are given by expectations 
of appropriate functionals under the other dynamic with the time direction reversed. This will be 
explained in detail later on. 

The duality relation which we present here is a special case of a broader new duality theory for 
multitypc models, which allows a historical and genealogical interpretation and which has been devel- 
oped in |DGselj covering much more general situations than we can discuss here. More information 
on the particle system can be found in |SchirmlO] . 

Remark 1 In the framework just sketched we can analyse the features of the population as described 
above. If one wants to adapt a more realistic model for geographic space the mean-field limit has 
to be replaced by the hierarchical mean-field limit for which the present analysis is a key ingredient. 
Then it is possible to study asymptotically two-dimensional geographic space via its approximations 
by the hierarchical group of order N and N — > oo. This also allows us to investigate the question of 
universality of the behaviour. All this is carried out in WGseltf and we refer the interested reader to 
this paper. 

0.2 Outline 

In section Q]wc shall present the Fisher- Wright model, in section [5] the logistic branching model and 
in section [3] the connection between both via duality. 

1 The Fisher- Wright model with rare mutation and selection: 
Behaviour in two time windows 

Here we introduce the Fisher- Wright model, some relevant time windows, state its properties in two 
time windows and consider the McKean-Vlasov limit for the model in five separate subsections. 

1.1 A two-type mean-field diffusion model and its description 

We study a population with two types, one of low and the other of high fitness, where initially all the 
population is of the lower type but by a rare mutation the advantageous type appears and spreads 
with time. The population is described specifying the proportion of the type 1 (the inferior) in every 
spatial colony. 

Formally we look at a process (X N (t))t>o with N G N of the following form: 

(0.1) X N (t) = ((x N (i,t),x?(i,t)); i = l,..., N), 

(0.2) xf(i,0) = l, i = l,---,N, 
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satisfying the well-known SSDE: 

(0.3) dx?(i,t) = c(x?{t)-x?(i,t))dt - sx?(i,t)x%(i,t)dt-yx?(i,t)dt 



+ \J d ■ x^(i,t)x^(i,t)dwi(i,t), 



(0.4) dx%{i,t) = c(x$(t)-x%(i,t))dt + sxf (i,t)x%{i,t)dt + —x?(i,t)dt 



yd ■ x^(i, t)x£(i, t)dw2(i, t), 



where W2(i,t) = —Wi(i,t) and {(wi(i, t))t>o,i = l,...,iV} are i.i.d. Brownian motions, m,d,s,L G 
(0, oo ) and 



1 N 

(0-5) xf{t) = -xf (t) with I = 1,2, xf (t) = J2*i (*".*)■ 

8=1 

Later we will use the parameter of mutation strength and size of geographic space satisfying 
(0.6) L = N. 

We can study this system in various ways, locally by looking at a if tagged sites 
(0.7) (xf(l,t),---,xf(K,t)), 1 = 1,2 

or globally using the concept of the empirical measure of the complete population: 

N 

N 



1 N 
(0.8) E N (t) :=kT, S (*?d,t),x?(i,t)) e V(V({1, 2})) 



and the empirical measure process of either type: 

N 

N 



1 N 
(0.9) S N (t, £) := - J2 6 *?d,t) 6 ^([0. 1]), < = 1, 2. 



8 = 1 



Note that since x^(i,t) + x% (i,t) = 1 it suffices in the case of two types to know one component of 
the pair in (|0.9p . the other is then determined by this condition. Then for two types we can effectively 
replace V(V({1, 2})) by ^([0, 1]) as states of the empirical measure. 

However since very soon rare mutants appear somewhere in space they are present for t > 0, but 
of course the set of sites where they are visible is sparse, we have to find a way to describe this small 
subset where the advantageous type appears. One (global) way is to consider the process 

N 



(0.10) xf(t) =^xf{i,t) with£= 1,2, 



but this way we lose the internal structure of the droplet of sites with advantageous types of substantial 
mass. In order to keep track of the sparse set of sites at which nontrivial mass appears we will give a 
random label to each site and define the following atomic-measure-valued process. 
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We assign independent of the process a point a(j) randomly in [0, 1] to each site j € {1, . . . , N}, 
that is, we define the collection 

(0.11) {a(j), j = l,---,N} i.i.d. uniform on [0, 1]. 

We then associate with our process and a realization of the random labels a measure-valued process 
on V([0, 1]), which we denote by (2 t ' m )t>o where 

N 

(o.i2) :f m = J2^U,t)s aU) . 

The process (2 t ' m )t>o describes the essential features of the advantageous population. 

1.2 Two time windows for the spread of the advantageous type 

The first question now is how the advantageous type develops in finite time and the second on what 
time scale does the advantageous type take over the whole population meaning we want to identify 
a time T^x after which the advantageous type has positive spatial intensity. A key role is played by 
space and the fact that the dynamic is random and not deterministic as in the scenario looked at by 
jBuj . So we look at the cases N = 1, N large and L small, d = 0, d > 0. 

Case N = 1 (nonspatial) 

Here we find the time scales in which we emerge (Tat,l = Tl) as L — >• oo to be 

(0.13) 0(s _1 log X) for d = 0, 

(0.14) 0(L) for d > 0. 

This qualitative difference is essentially due to the fact that in the stochastic model the diffusion 
can hit by sheer random effects, while in the deterministic model the advantageous type expands 
exponentially fast leading to a logL time scale. 

Case N 3> 1 (large spatial model) 

Here we take L = N(Tn_l = TV) and then we find also in the deterministic case the time scale 
s _1 log N, but also now in the stochastic model, d > 0, we have a time period a -1 logiV but now the 
a will turn out to be strictly smaller than s. In the sequel we analyse the latter case in more detail 
and identify the constant a. 

Remark 2 The relation L = N^ 1 is appropriate if one considers the mean-field model which in the 
hierarchical mean-field limit is a potential-theoretic analogue of two-dimensional space as N — > oo, see 
WGseflj and for which migration, mutation, selection are all three together important. 

The task is now to describe the system in the limit TV — > oo in two time windows: 



• 



times of order 1 after starting: (X (t)) 



• times a l log N after starting: (X N (a l logiV + £) + )t € R. 

In the first window the evolution of the small advantageous droplet is of primary interest, in other 
words the process 1 N ' m , while in the second case when the droplet covers a positive proportion of the 
whole space, the system is best described by the empirical measure "E.N (see l0.8j) or the tagged sample 
see (jOTT|) . 



1 THE FISHER-WRIGHT MODEL WITH RARE MUTATION AND SELECTION 



1.3 The early time window as iV — > oo 

Here we want to describe (1) the evolution of the atomic measure- valued process 2 N ' m in the limit 
N — > oo over times in some finite interval and (2) the limit N — > oo of the dynamic of the empirical 
measure. 

(1) Droplet evolution 

During the early times there will be a finite random number of sites where the advantageous type 
has mass exceeding some prescribed e > 0. Therefore as N — > oo we expect a limiting evolution of 
(J,^)t>o- First of all we can show 

Proposition 1.1 (Limiting droplet dynamic) 

As N ->■ oo 

(0.15) £[Qf' m ) t > ] => C[(lT)t>o], 

N—>oo 

in the sense of convergence of continuous M a ([0, 1]) -valued processes where the set of atomic measures 
M a ([0, 1]) is equipped with the weak atomic topology. The topology is introduced in \EKJ$ and we don't 
touch this further here. □ 



To identify the limit evolution 1 m we need a bit of classical excursion theory (compare [PYj , section 
3, pi]). 

Lemma 1.2 (Single site: entrance and excursion laws) 

(a) Let c > 0, d > 0, s > 0. Then is an exit boundary for the the Fisher-Wright diffusion 



(0.16) dx{t) = -cx(t)dt + sx(t)(l - x(t))dt + y/d-x(t)(l - x(t))dw(t), 

which then has a a -finite entrance law from state at time 0, the a -finite excursion law 

(0.17) Q = Q c ' d ' s 

on 

(0.18) Wq := {w G C([0, oo), K + ), w(0) = 0, w(t) > for < t < ( for some C G (0, oo)}. 

(b) Moreover, denoting by P E the law of the process started with w(0) = e and e > 0, Q is given 
by: 

(0.19) Q(.) = lim-^, 

where S(-) is the scale function of the diffusion HU.16}) . defined by the relation, 

(0.20) p e (T„<oo) = ||^, 0<£< ?? <oo, 

where T v is the first hitting time of r/. 

For the Fisher- Wright diffusion S is given by (cf \RWf . V28)) the initial value problem: 

dS t>~2sx 

(0.21) 5(0) =0,^ 



dx (l-x) 2c ' 
so that 

(0.22) lim ^ = 1. 
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(c) The measure Q is a-finite, namely for any n > 0, C as in hO.18} ), 
(0.23) Q({w : CH > r,}) < oo, 

(0.24) Q(sup(w(i)) > n) = Q(T n < oo) - — - — »- oo as 77 -> 0, 
t 5(77) 



(0.25) 



/ xQ{sup(w{t)) £ dx) = 00. D 
Jo t 



Now the limit (N — > 00) droplet dynamic 1 m can be identified as follows. 

Proposition 1.3 (A continuous atomic-measure-valued Markov process) 

Let N(ds, da, du, dw) be a Poisson random measure on (recall \0.18\) for Wo) 

(0.26) [0, 00) x [0, 1] x [0, 00) x W a , 

with intensity measure 

(0.27) dsdaduQ{dw), 

where Q is the single site excursion law defined in It0.19\) in Lemma \l.£\ 
Then the following two properties hold. 

(a) The stochastic integral equation for Q™)t>o given as 

j-t f j-q(s,a) p 

(0.28) T t l = / / / / w{t-s)S a N(ds,da,du,dw), t>0, 

JO J[0,1] Jo Jw 

where g(s, a) denotes the non-negative predictable function 
(0.29) g(*,o):=(m + c^L([0,l])), 

has a unique continuous A4 a ([0, 1]) -valued solution, which equals (2Y l )t>o from equation \0.15\) . 

(b) The process (H™)t>o has the following properties: 

• branching property (the process is like a branching process with immigration) 

• the mass of each atom observed from the time of its creation follows an excursion from zero 
generated from the excursion law Q (see i0.19\) ). 

• new excursions are produced at time t at rate 
(0.30) ro + c3j*([0, 1]), 

• each new excursion produces an atom located at a point a £ [0, 1] chosen according to the uniform 
distribution on [0, 1], 

• at each t and e > there are at most finitely many atoms of size > e. □ 
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(2) McKean-Vlasov equation for limiting empirical measure. 

Turn now to the global description of the complete population by the empirical measure. In the 
limit N — > oo the evolution of the empirical measure in a finite time window is given by the McKcan- 
Vlasov limit of our system, but of course it is trivial, that is, totally concentrated on type 1 if the 
given initial state has this property. This is however different at late times. Consider therefore the 
above system (|0. 1[) - (|0.4[) of N interacting sites with type space K = {1, 2} starting at time t = from 
a product measure (that is, i.i.d. initial values at the N sites). The basic McKean-Vlasov limit (cf. 
[DG99J . Theorem 9) says that if we start initially in an i.i.d. distribution, then 

(0.31) {E N {t)} <t<T =^ {£t}o<t<T, 

— — JV — ► oo 

where the 'P(7-'({1, 2})-valued deterministic path {Ct}o<t<T is the law of a nonlinear Markov process 
solving a forward equation, namely the unique weak solution of the McKean- Vlasov equation: 

d^ 
dt 

where for 7r 6 V(V(K)), L" is given by the generator of the process given by the evolution of type 1 
in (|0.34p below, (and n = m(t)) and the * indicates the adjoint of an operator mapping from a dense 
subspace of Cb{E,M) into Cb(E,R) w.r.t. the pairing of V(E) and Cb{E,M) given by the integral of 
the function with respect to the measure. 

As pointed out above in (|0.9p . in the special case of the type set {1,2}, we can simplify by 
considering the frequency of type 2 only and by reformulating (|0.32[) living on V(V({1, 2})) in terms 
of C t {2) G V[0, 1]. This we carry out now. 

Namely we note that given the mean-curve 



(0.32) ^i = (Lf*)*£ t , 



(0.33) m(t)= / yC t (2)(dy), 

[0,1] 

the process (£t(2))t>o is the law of the solution of (i.e. the unique weak solution) the SDE: 



(0.34) dy(t) = c(m(t) - y{t))dt + sy(t)(l - y{t))dt + y/dy(t)(l - y(t)dw(t), 

with w being standard BM. Then informally (£t)t>o corresponds to the solution of the nonlinear 
diffusion equation. Namely for t > 0, £j(2)(-) is absolutely continuous and for 

(0.35) Ct(2)(dx)=u(t,x)dx e^([0,l]) 

the evolution equation of the density u(t, ■) is given by: 

d d f d d d 2 

(0-36) —u(t,x) = -c—{[ / yu(t 1 y)dy-x]u(t 1 x)}~s—(x(l~x)u(t,x)) + -— I (x{l~x)u(t,x)). 

[o.i] 
We have the following basic property for the McKean-Vlasov equation. 

Proposition 1.4 (McKean-Vlasov equation and its solution) 

(a) Given the initial state fio £ 7- > ([0, 1]) there exists a unique solution 

(0.37) C t (2)(dx) = nt(dx), t>t Q 

to liO.32)) with initial condition Ct {2) = /iq. 

(b) If s > and J, Q ., x^t {dx) > 0, then this solution satisfies: 

(0.38) lim C t (2)(dx)=6 1 {dx). □ 
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1.4 The late time window as TV — > oo 

In the late time window we see (global) emergence and then fixation of the advantageous type. 

(1) Emergence times. Here we begin by studying the emergence by identifying the time of emer- 
gence and of fixation of the advantageous type as follows. 

Proposition 1.5 (Macroscopic emergence and fixation times) 

(a) (Emergence-time) 

There exists a constant a with: 

(0.39) < a < s, 

such that if TV = — log N, then for ( e K and asymptotically as N — > oo type 2 is present at times 
T/v + 1, i.e. there exists a e > such that for every i, 

(0.40) liminf P[x%(i, T N + t) > e] > 0, 

N— >OG 

and type 2 is not present earlier, namely for 1 > e > 0: 
(0.41) lim \xm8up[P(xi(i,T N +t)<l-e] = 0. 

t-i—oo jv-s-oo 

(b) (Fixation time) 

After emergence the fixation occurs in times 0(1) as N — > oo, i.e. for any e > 0, i G N 

(0.42) lim limsupP[zf (i,T N + 1) > e] = 0. □ 

i-s-oo jv-s-oo 

Corollary 1.6 (Emergence and fixation times of spatial density) 

The relations {O.^Cfy , {0-41^ and {0-42^ hold for x^ respectively Xi as well. □ 

We can identify the parameter a as follows from the droplet growth behaviour. 

Proposition 1.7 (Long-time growth behavior of"!™) 

Assume that m > 0. Then the following growth behavior of1 m holds, 
(a) There exists a* such that the following limit exists 

(0.43) lim e- Q *'£p™([0, 1])] e (0, oo), 

t— >oo 

with (here a is from i().. r J9\ )) 

(0.44) a* — a, where a is given below in Ii0.93\) 

(0.45) e- at 7 t n ([0, 11) =*> W* in probability, < W* < oo a.s.. 

(b) The growth factor in the exponential is truly random: 
(0.46) < Var[ lim e - at 2T([0, 1])] < oo. □ 

t— >oo 

Remark 3 The random variable W* reflects the growth of1" l ([0, 1]) in the beginning, as is the case 
in a supercritical branching process and hence £* = a _1 logW* can be viewed as the random time 
shift of that exponential e at which matches the total mass ofl™ for large t. 
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Remark 4 Even though one might think that for small mass of the advantageous type this expands 
at the rate e st , this is not the case due to the stochastic effects leading to a subtle interplay between 
the parameters s,d and c resulting in a* = a < s. 

(2) Fixation dynamic. We now understand the preemergence situation and the time of emergence 
and fixation. In order to describe the whole dynamics of macroscopic fixation, we consider the limiting 
distributions of the empirical measure- valued processes in a second time window (a -1 log N+t) + , t £ 
M. Define (we suppress the truncation in the notation below) 

1 N 
(0.47) S^' a (t):=T7E <y (xf(i,^ + t),x-(i,l 2 ^ + t)). ' eK > (Z l ^ a (t)eV(V(K))) 

and then the two empirical marginals are given as 

1 N 
(0.48) S^M):=^E^f(i,^+tV «=l,2andteK, (^ a (t,£) G V([0, 1])). 

i=l 

Note that for each t and given £ the latter is a random measure on [0,1]. Furthermore we have the 
representation of the empirical mean of type 2 as follows: 

(0.49) a^(!2!__ +t ) = / x E l ^ a (t,2)(dx). 

a J [0,1] 

Since we consider the limits of systems observed in the interval const ■ log TV + [— ?, -=■] with T 
any positive number, that is setting to(N) = const ■ log N — -£, we need to identify entrance laws for 
the process from -co (by considering T — > oo) out of the state concentrated on type 1 with certain 
additional properties. 

The next main result is on the fixation process, saying that 5^ S,Q converges as N — > oo and that 
the limit can be explicitly identified as a random McKean-Vlasov entrance law from — oo, a concept 
we explain next. 

Definition 1.8 (Entrance law from t = — oo,) 

We say in the two-type case that a probability measure-valued function C : R — > "P([0,1]), is an 
entrance law at — oo starting from type 1 if (Ct)te(-oo,oo) * s such that C t solves the McKean-Vlasov 
equation W.32\) and Ct —> 5% as t —± — oo. □ 

We will indeed establish that the emergence of rare mutants gives rise to "random " solutions of the 
McKean-Vlasov dynamics. In particular we will show that the limiting empirical measures at times 
of the form ClogiV + t are random probability measures on [0, 1] and therefore given by sequences 
of [0,1]- valued truly exchangeable random variables which are not i.i.d., that is, the exchangeable 
er-algebra is not trivial. This means that the limiting empirical mean turns out to be a random 
variable and this is the driving term due to migration for the local evolution of a site in the McKean- 
Vlasov limit. Therefore both the random driving term and the non-linearity of the evolution equation 
come seriously into play. However once we condition on the exchangeable cr-algebra, we then get 
for the further evolution again a deterministic limiting equation for the empirical measures, namely 
the McKean-Vlasov equation. The reason for this is the fact that conditioned on the exchangeable cr- 
algebra we obtain an asymptotically (as N —> oo) i.i.d. configuration to which the classical convergence 
theorem applies. Using the Feller property of the system, we get our claim. This leads to the task of 
identifying an entrance law in terms of a random initial condition at time — oo. 

The above discussion shows that we need to introduce the notion of a truly random McKean-Vlasov 
entrance law from — oo. 
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Definition 1.9 (Random entrance laws of McKean-Vlasov from t = — ooj 

We say that the probability measure-valued process {C(t)}teR is a random solution of the McKean- 
Vlasov equation iO.Sty) if 

• {Ct '■ t £ R} is a.s. a solution to W.32\) . that is, for every to the distribution of {Ct ■ t > to} 
conditioned on Tt Q = o{C s : s < to} is given by <5{ Mt } t>t where [it is a solution of the McKean- 
Vlasov equation with [i to = Ct , 

• the time t marginal distributions of {Ct ■ t £ R} are truly random. □ 

We can say the following about the possible random entrance laws. 

Proposition 1.10 (Random entrance laws) 

(a) There exists a solution (£j*(2))tgg to equation \0.32\) satisfying the conditions: 



(0.50) lim C* t *{2) = <5, 



o- 



lim £**(2) = 6i 

t— >oo 

/ xC* *(2,dx) = \. 

J [0,1] z 

This solution is called an entrance law from — oo with mean i at t = 0. 

(b) We can obtain a solution in (a) such that: 

(0.51) 3a £ (0,s) and A £ (0, oo) such that lim e Q|t| / xC* t *{2,dx) = A . 

*^-°° J[o,i] 

(c) The solution of W. 32\) also satisfying W.51\) for prescribed Ao is unique and if Aq £ (0, oo) then a 
is necessarily uniquely determined. 

For any deterministic solution 

(0.52) {C t , t £ R} 



to IQgj) with 

(0.53) 0< limsupe" 1 * 1 / xC t (2,dx) < oo, 

t-¥-00 ^ [0,1] 

the limit A = lim t _ ) ._ 00 e a '*' JL ,, xCt(2, dx) exists. 

If A > 0, then {Ct,t £ R} is given by a time shift of the then unique {C$*,t £ R} singled out in 
IPI1) . i.e. 

(0.54) Ct = C;* +T , r = a- 1 \og^. 

Ao 

For future reference we define (£j) t6 R to be the unique solution satisfying 

(0.55) lim e Q|t| / xC* t (2,dx) = 1 for some a £ (0, s). 

*^-°° J[o,i] 

(d) Any random solution (£ t ) i£ R to W.32\) such that 

(0.56) lim sup e a ^E[[ xC t (2, dx)} < oo, liminf e a|t| [ / xC t (2,dx)} > a.s., 
t-t-oo J [Q t l] t->— oo yj 0]1 ] 

is a random time shift of (C^*)t^m and of (,Q )t S R. D 
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Example 1 Let L\ be a solution satisfying HU.36}) , W.51\) and for a given value of A let t be a true 
real-valued random variable. Then {C^_ T } t ^R is a truly random solution. This can also be viewed as 
saying that we have a solution with an exponential growth factor A which is truly random. 

The emergence of the advantageous type and the subsequent evolution to fixation in this type is 
characterized as follows. 

Proposition 1.11 (Asymptotic macroscopic emergence- fixation process) 

(a) For each — oo < t < oo the empirical measures converge weakly to a random measure: 

(0.57) C[{E l ^ a (t 7 £)}} ^£[{C t (£)}} = Pi eV(V([0,l})), for£=l,2. 

In addition we have path convergence: 

(0.58) w-Vm £[(^ a (f)) ta ]=i'e?[C((^, M ),P(P({0,l})))]. 

A realization of P is denoted (Ct)tem. respectively its marginal processes (Ct(l))tem., (£t(2))teit- 

(b) The process {Ct)tes. describes the emergence and fixation dynamics, that is, for ( £ I, and 
e>0, 

(0.59) lim Prob[C t {2){{e, 1]) > el = 0, 

t— > — OO 

(0.60) lim Prob[C t {2){[\ - e, ll) < 1 - el = 0, 

t— >-oo 

with 

(0.61) £ t (2)((0,l))>0 , VtER, a.s.. 

(c) The limiting dynamic in {0.58$ is identified as follows: 

The probability measure P in $0.58]) is such that the canonical process is a random solution (recall 
Definition \1.9\) and entrance law from time — oo to the McKean-Vlasov equation \0.32\) . 

(d) The limiting dynamic in W. 58\) satisfies with a as in W.39)) : 

(0.62) £[e Q|f| / xCt(2)(dx)} =»■ C[*W] as t ->■ -oo. 

J [0,1] 
and we explicitly identify the random element generating P in \0.58\) . namely P arises from random 
shift of a deterministic path: 

(0.63) P = C[t*£C*} , *£ = (log* W)/a, r r is the time-shift of path by r, 

where C* is the unique and deterministic entrance law of the McKean- Vlasov equation \0.32}) with 
projection £*(2) on the type 2 coordinate satisfying: 

(0.64) e Q|t| / xC* t {2){dx) — >• 1, as t ^ -oo. 

J [0A] 
The random variable *W satisfies 

(0.65) < *W < oo a.s., E[*W] < oo, < Var(*W) < oo. 

(e) We have for sn — > oo with Sjv — "(log N) the approximation property for the growth behaviour 
of the limit dynamic by the finite N model, namely: 

(0.66) C[e aSN x^{^^- - s N )} =*> C[*W}. 
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How does this emergence in (|0.66[) relate to the droplet growth? We have 

Proposition 1.12 (Microscopic emergence and evolution: droplet formation) 
The total type-2 mass \ ' m ([0, 1]) grows at exponential rate a, 

(0.67) C fc m ([0, l])e- atN ] => C[W*l for t N t oo with t N - (a" 1 log iV) — > -oo, 

(0.68) £[*W] = C[W*]. □ 

2 A logistic branching random walk and its growth 

We define here a logistic branching population, take the mean-field limit N — > oo, study the expansion 
of the droplet of occupied sites, determine the late time window and finally the behaviour in a late time 
window (i.e. shift of observation time interval and size of space tend to infinity) in four subsections. 

2.1 The logistic branching particle model 

We now consider a particle system on the geographic space {1, • • • , N} and its occupation number 
configuration with state space 

(0.69) (N ) {1 '- ,JV} . 
Here particles 

• migrate according to a continuous time (rate c) random walk with uniform step distribution, 

• a particular particle at a site i dies at rate d(k — 1) if we have k particles at site i, 

• a particle has one offspring at rate s which is placed at the same site. 

This defines uniquely a strong Markov pure jump process on our state space, which we denote by 

(0-70) fo*)t>o- 

Remark 5 Note that the mean production rate (mean growth rate) of this model in state k is 

(0.71) sk-d(k(k-l)), 

which is a concave function f with /(0) = 0, /(l) = si, f(k) > for k < ko and f(k) < for k > ko- 
This production rate can be interpreted as reflecting limited local resources for a population. 

Remark 6 We can view this process as a supercritical branching random walk (supercriticality pa- 
rameter s) with an additional linear death rate d(k — 1) per individual thus inducing an interaction 
between families. The process is also called a coalescing branching random walk. 

Due to the quadratic death rate the population can only expand indefinitely by having individuals 
move to sites where so far no particles are sitting. When the space is filled the expansion of the 
population is replaced by an equilibrium situation. How to make this precise? 

We want to study this particle system for N — > oo in finite time windows, one early starting at 
time and the other time windows beginning at a late time when space fills up with particles. Here 
we start with one particle and determine the late time window by asking when does the population 
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develop a positive spatial intensity even as N — > oo. Then in particular we focus on the influence of 
the dcathratc d and the migration rate c on the speed of spatial spread. The goal then is to establish 
that the population grows exponentially fast with a rate a which is positive but strictly less than the 
birth rate s as long as we have no collisions. 

Remark 7 Note that if d = 0, then in fact we have a supercritical branching process at exponential 
rate s and it would take time ^ log N to develop positive spatial intensity. 

If we have only one site we have a classical birth and death process, in the spatial model we have 
a collection of such processes. Since the death rate is zero if we have only one particle at a site, and 
furthermore because the death rate is quadratic, we have a positive recurrent Markov process on the 
state space N$ with a unique equilibrium law denoted: 

(0.72) ir N = n» s>d . Write 7r c , s , d for tt* .,„. 

Out of the finite initial state a new site can be occupied and the population can grow till the jumps 
can only hit already occupied sites and this way a population intensity on the whole space can develop 
and a local equilibrium forms, resulting in a global equilibrium density. 

2.2 The early time window as N — > oo 

As N —> oo the geographic space expands to N and the migrating particles eventually do not hit 
occupied sites in a fixed time interval (if we start with finitely many initial particles, the collision-free 
regime, later when collisions occur sites interact again). More precisely, we want to establish that, as 
N — > oo, we get as limit dynamic a collection of birth and death processes with emigration at rate c 
and immigration at rate c from a reservoir with the current intensity in the total population. This is 
carried out as follows. 

Consider a branching random walk with birth and death as before but now with geographic space 
N, so that the state space becomes 

(0.73) (No)" 

and migration changes to 

• emigration out of any component at rate c to the unoccupied site of lowest index as long as 
there is more than one particle 

• immigration at rate c • i into every colony, with i G [0, oo). 
Here if we start the system in an exchangeable initial state we choose 



(0.74) l= Jim (N- l J2v(i))- 



Later in larger time scales we will obtain a i which is time-dependent and arises from the law at a 
tagged site. For i = we obtain what we call the collision-free process. 

The strong Markov process defined by the above McKean-Vlasov dynamic is denoted 

(0.75) (77*)*>o ( rcsp. (r?t)t>o if t = 0). 

This process has for every value i G [0, oo) a unique equilibrium 



(0.76) IT 



i n 



0) 
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where tt l c d is the single site equilibrium. Let r be any map which permutes the location such that 
1, 2, • • • , lOI^^) > 0}| are all occupied and acts on paths by achieving the constraint at the final 
time. We prove 

Proposition 2.1 (Convergence to t-process, collision-free process) 
If we start in an i.i.d. distribution with E\rff (i)\ = i, then 

(0.77) £[(^)t>o] => C[(r^) t>0 }. 

If we start with one initial particle at site 1 and if tw = o(logiV), then 

(0.78) C[(r T,?) t>0 ]=>£[(r,t)t>o]. □ 

JV— Kxi 

2.3 The droplet expansion and Crump-Mode-Jagers processes 

As time gets large we expect the populations of particles to occupy more and more sites (droplet), 
such that the overall population expands as time grows, i.e. we have a growing droplet of occupied 
sites. This droplet growth is expected to be exponential and as time grows to become (at least the 
further evolution) more and more deterministic due to a law of large numbers effect. Here we have to 
distinguish a time window, which is late but where particles essentially always move to new unoccupied 
sites if they migrate and a later phase where spatial intensity builds up and collisions play a role. This 
subsection handles the first regime. 

If we establish an exponential growth behaviour we would be able to determine the position of the 
late time window where the population density becomes positive as time and N tend to infinity. How 
can we make all this rigorous? 

To analyse this time window an important concept is that of a CMJ-process (Crump-Mode-Jagers 
process) which allows us to describe the number of occupied sites in the process (j] t )t>o starting from 
finitely many particles occupying all sites 1, • ■ • , k with k > 1. Let 

(0.79) K t = #{i G N|»h(*) > 0}, 

(0.80) Ct = {Ct(»). t 6 1, ••-,#*} , (t(i)=Vt(i), i=l,---,K t . 

We will find that K, ( are processes which have the structure of a CMJ-process that is the process of 
occupied sites is a type of generalised branching process. We begin by recalling this concept. 

1. Crump- Mode- J agers process 

The CMJ-process models individuals in a branching population whose dynamics is as follows. 
Individuals can die or give birth to new individuals based on the following ingredients: 

• individuals have a lifetime (possibly infinite), 

• for each individual an independent realization of a point process £(£) starting at the birth time 
specifying the times at which the individual gives birth to new individuals, 

• different individuals act independently, 

• the process of birth times is not concentrated on a lattice. 
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The process might be growing exponentially and then we want to determine its exponential growth 
rate. The corresponding Malthusian parameter, a > is obtained as the unique solution of 



(0.81) / e~ at n(dt) = 1 where //([0,i]) = E[£([0,t])}, 



10 

with 

(0.82) £(£) counting the number of births of a single individual up to time t, 

(see for example |J92j . [N] equation (1.4)). 

If we know the Malthusian parameter, we need to know that it is actually equal to the almost 
sure growth rate of the population. It is known for a CMJ-process (K t ) t >o that (Proposition 1.1 and 
Theorem 5.4 in [N]) the following basic growth theorem holds. If 

oo 

(0.83) E[X log(X V 1)] < oo, where X = j e~ at d^(t), 

o 
then 

(0.84) lim — t = W, a.s. and in Li, 

where W is a random variable which has two important properties, namely 

(0.85) W > 0, a.s. and E[W] < oo. 

The empirical distribution of individuals of a certain age in the population converges as t — > oo to 
a stable age distribution 

(0.86) U{oo,du) on [0,oo), 

according to Corollary 6.4 in [N], if condition 6.1 therein holds. The condition 6.1 in JN] or (3.1) in 
jJNj requires that (with fj, as in (|0.81|l ): 



/>oo 

(0.87) / e~ f3t fi(dt) < oo for some f3 < a. 

Jo 



'0 

2. Application of the CMJ-theory to (K t ,Ct) 

The (Ct)t>o we have introduced is a CMJ where individuals are the occupied sites and satisfying 
all the conditions posed above. In addition we have more structure, namely the birth process is 
determined from an internal state of the individual (here a site) which follows a Markovian evolution 
(here our (Ct(*))t>o)- This allows to obtain some stronger statements as follows. 

Define 

(0.88) (Co(M))t>o, as the process (Ct(*))t>o) if i gets occupied at time 0. 
In our case we then have 

(0.89) K[0,t]) = c f E[Co(s)l (Co(s) > 2) }ds. 
Jo 

Define the random measure 
(0.90) U(t, du,j) = #{ sites of size j with birth time t — du}if t _1 . 
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It follows that the random measure U(t, •, •) converges to a deterministic object, the stable age and 
size distribution, i.e. 

(0.91) U(t,-,-) =>U(oo,-,-), as t^oo inlaw, U{oo, -,N) = U(oo, •) from fl0H6). 

We can obtain from this the following representation of the Malthusian parameter a. 

Given site i let tj > denote the time at which a migrant (or initial particle) first occupies it. 
Noting that we can verify Condition 5.1 in [N] we have that 

1 f 

(0.92) lim — y^Cr-(<-Ti) = / £[Co(«)Moo,du) =B (a constant), a.s., 

1=1 o 

by Corollary 5.5 of [N]. The constant B in (|0.92[) is in our case given by the average number of particles 
per occupied site and the growth rate a arises from this quantity neglecting single occupation. Namely 
define 

OO 

(0.93) a = c^jW(oo,[0,oo),j) < oo, 7 = c U(oo, [0, 00), 1). 

Then the mean occupation number of an occupied site is given by 

(0.94) B = ^±2 > 0. 
c 

Furthermore the average birth rate of new sites (by arrival of a migrant at an unoccupied site) at time 
t (in the process in the McKean-Vlasov dual) is equal to 

(0.95) a = c B - 7. 

Here a remarkable point is the relation between a and s. Recall s is the parameter of supercriticality 
in the branching part of the internal state dynamics. The action of the quadratic death part and the 
role of migration lead to a number a with 

(0.96) < a < s, 

since the fraction of the supercritical branching process with supercriticality parameter s which die 
by the quadratic death rate is positive. 

2.4 Time point of emergence as N — > 00 

If we take the collision-free model (rj t )t>o an d WG observe at time Tjv(t) = a^ 1 log N + t the number 
of individuals or the number of occupied sites, both normalized by N, then these quantities satisfy 
(recall (HTMl) . (KTM)) ) 

(°- 97 ) \m E »&Mt)» ^B^+ a " 110 ^) ( x Tjv(t) /7V) tGK ^( e ^+^ llo ^) )teR; 

\ ^ J 

with < W < 00 a.s. and Var{W) > 0. Hence a positive intensity develops: 



(0.98) ]T VT N (t)(i)\ -^B, Vte 



K TN (t) 
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In particular in this time window rj N and r\ differ significantly, but one can show that still: 



(0.99) C[({r o ??" (1), • • • , (r o 7# w+t (fc)}) t >„] => £[(m(l), • • ■ , m(*))t>o], as JV 



OO, 



provided Tjv — of" 1 log .AT — ► — oo as iV — > oo. In fact even though the CMJ-approximation breaks 
down at times a log N + t we can still prove emergence occurs at time a -1 log N, since t?^ develops a 
positive intensity meaning the total population is comparable to N, which as t — > — oo is asymptotically 
equivalent to the r.h.s. of (|0.97[) . Indeed time a -1 log TV separates the collision-free droplet growth 
from the emergence. 

Proposition 2.2 (Emergence of positive intensity) 
We have for N — > oo and v = f((£jv)jveN)' 

i «— v at I ^o , tN — ci 1 logiV —► — oo 

(0.100) C[N-^<ii)]^t W,K(0,oo)) = l, lim(^-a- 1 log7V)=t>-oo. D 

i=l L iV->oo 

2.5 The late time window as TV — > oo 

To treat the time window a -1 log N + 1 and to obtain the limiting emergence-equilibration dynamics 
based on random entrance laws of the McKean-Vlasov equation, we need some ingredients. 

We consider the number of sites occupied at time t denoted K^ and the corresponding measure- 
valued process on [0, oo) x No giving the unnormalizcd number of sites of a certain age u in [a, b) and 
occupation size j: 

(0.101) * N (t,[a,b),j))= [ 1 {k n >k n ^N^^dK", 

J(t-b) 

where Cu (t) denotes the occupation number at time t of the site born at time u, that is, a site first 
occupied the last time at time u, which is therefore at time t exactly of age t — u. 

The normalized empirical age and size distribution among the occupied sites is defined as: 

(0.102) U N (t,[a,b),j) = ^ w ^ N (t,[a,b),j), t > 0, j e {1,2,3, .. .}. 

Denote now for convenience the number of sites: 

(0.103) u N {t) ■- Kf. 

We have obtained with (|0.102j) . (|0.103|) a pair which is Nx'P([0, oo) xN)-valued and which describes 
our particle system completely provided if we consider individuals and sites as exchangeable labels. 
This pair is denoted 

(0.104) (u N (t),U N (t,;-)) t >o- 

In the interval a -1 [log N — log log N, log N+T] the process (u N (t)) t >o increases by one respectively 
decreases by one at rates 

1l N (t\ (u N (t\\ 2 

(0.105) a N (t)(l - -^l)u N (t), respectively lN (t) [ ^» , 
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where ckjv (t) , 7jv (t) arc defined: 

t t 

/oo ,. 

^jU N (t,dsJ)ds, j N (t) =c U N (t,ds,l). 

J=2 

The above rates of change of U N (t, ■, ■) follow directly from the dynamics of the particle system rj N . 
Our goal is now to study the behaviour of (u N , U N ) at times a^ 1 logiV + t and to show that this 
follows a limiting fixation dynamics. We start the system with k particles at £ distinct sites and write 
k, I as superscript. We need the time-shifted quantities: 

(0.107) u N ^\t) = «".M((k|^ + 1 ) v 0), t € (-oo, T], «^.W(_^I^) = £ 

a a 

(0.108) £f^M(i) = t/W (( i2I^ +i )v0), te(-oc,T], ^.M ( _l2l^) = S(k0) . 

a a 

Proposition 2.3 (Convergence to a colonization- equilibration dynamic in the N — > oo limit) 

Assume that for some to £ K as N —> oo, (-hu ' ' (to),U N ' ' (to)) converges in law to the pair 

(u(t(j),U(to)) automatically contained in [0,oo) x Li(N, v). 
Then as N — > oo 



(0.109) C 



(^• W (*),^' w (*,v))) t 



£[(^(t),f/^(t, v ))) t > i0 ] 



m Zaw on pathspace, where the r.h.s. is supported on the solution of the nonlinear system iO.lll]) 
and W.112}) corresponding to the initial state (u(t ),U(to)). (Note that the mechanism of the limit 
dynamics does not depend on k or t, but the state at t will.) □ 

We obtain the limiting system (u, U) as follows. We specify the pair 
(0.110) (u, U) = (u(f), U{t)) tm , with (u(t), U{t)) 6l+x Xi(R+ x N), 
by the (coupled) system of nonlinear forward evolution equations: 

(0.111) ^ = a(t)(l - u(t))u(t) - 7 (t)w 2 (t), 



(0.112) 



dU(t,dv,j) 
dt 



dU(t,dv,j) 
dv 
+s(j - l)l m U{t,dv,j - 1) - s 3 U(t,dv,j) 

+~{j + l)jU{t, dv,j + 1) - -j(j - l))l j¥ iC/(t, dv,j) 

+c(j + l)U{t,dv,j + 1) - cjll{t,dv,j)l m 

-cu(t)U(t,dv,l)l j=1 

+u(t){a(t) + 1 (t))[l& 1 U(t,dv,j - 1) - C/(t,dw,i]) 

+ ((l-«(t))a(t))l J -=i-*o(d«) 

- (a(t)(l - u(t)) - 7(*)»(*)) ' ^ (*, *>» J")- 
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Wc have to constrain the state space to guarantee the r.h.s. above is well-defined. Set therefore v 
to be the measure on N given by 

(0.113) v(j) = l+j 2 

and consider R £g> L x (v, N) as a basic space for the analysis. Then the equations (|0.111[) . (|0.112|) have 
the following properties. 

Proposition 2.4 (Uniqueness of the pair (u, U)) 

(a) The pair of equations \0.111\) - W.112}) , given an initial state from R + x .A/fi(R + x N) satisfying 

(0.114) (u(O), t/(0, •, •)) e R ® L l {n, v) 

at time to, has a unique solution (u(t), U(t)) t >t with values in R + ® Lq_(N, v), which satisfies u > 
andU(t,R+ x N) = 1. 

(b) There exists a solution (u, U) with time parameter t G R for every A £ (0, oo) with values in 
M.(g> L 1 (N,iy), such that 

(0.115) u(t)e- at ^ A as t ^ -oo, 
(0.116) U{t) — > U{oo). 

Here U(oo) is the stable age and size distribution of the CMJ-process corresponding to the particle 
process {K t ,C t t)t>Q given by the McKean-Vlasov dual process r\, as defined in W.80\) . 

(c) Given any solution (u, U) of equations W.lll}) - \O.lV0fj for t £ I with values in the space 
R(g)i 1 (N,z/) satisfying 

(0.117) u(t)>0, limsupe _Q *u(t) < oo, 

t— > — oo 

we then have for u 

(0.118) A= lim e~ at u{t) 

t— > — oo 

exists and the solution satisfying this for given A is unique. Furthermore U satisfies 

(0.119) U(t) =>■ W(oo) os t ->■ -oo. □ 

Remark 8 Potential limits arising from (u N ,U N ) do satisfy equation {O.lTfy . 

Remark 9 Note that looking at the form of the equation we see that a solution indexed by R remains 
a solution if we make a time shift. This corresponds to the different possible values for the growth 
constant A in liO.115]) . In particular the entrance law from at time — oo is unique up to the time 
shift. 

We can now identify the behaviour of (u N , U N ) for N — > oo in terms of the early growth behaviour 
of the droplet of colonized sites as follows. 

Proposition 2.5 (Identification of colonization- equilibrium dynamics) 

The limits {u k ' (t), U k,i (t, •, •)))t>to> ™ WH3M can oe represented as the unique solution of the 
nonlinear system iO.lll)) and W.112\) satisfying 

(0.120) lim e- at u k -\t) = W k > e , lim U k ' 1 it) = U(oo), 

t— > — oo t— > — oo 

with W k ' e having the law of the random variable appearing as the scaling (by e~ at ) limit t — > oo of 
the CMJ-process K t ' started with k particles at each of I sites. □ 
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We can now ask, what happens if we consider times a 1 log N + ijv with t^ —* oo. Then we reach 
a global stable state. Let II£, d g II£ d conditioned to be strictly positive. 

Proposition 2.6 (Equilibrium population) 
We have 



c.d.s' 



(0.121) N- 1 u N (a- 1 \ogN + t N ) — y C , U N (a~ 1 logN + t N ) => U l * d 

N->-oo N — 

where L* satisfies the self consistency relation: 

oo 

(0.122) £>$,.({*}) = t. □ 



fc=i 



3 The duality relation 

In this section we relate the processes from section [T] and [5] with each other by duality and we discuss 
the extension of the duality to more than two types. 

3.1 A classical duality formula 

The key tool in relating the two processes we have introduced is duality. Recall the classical relation 
between a single Fisher- Wright diffusion and the Kingman coalescent. 
Consider (X t )t>o solving 



(0.123) dX t = y/d-X t (l-X t )dWt 

and (Dt)t>o being the N-valucd death process 

(0.124) n ->■ n - 1 at rate d ■ ( " 

Then a generator calculation shows (we explain more on the background below): 

(0.125) E Xo [(X t ) k ]=E k [X^]. 

The analogous relation can be formulated for our mean-field spatial model including mutation and 
selection. Define for the process starting with fc-particles at each of £ sites: 

oo 

(0.126) n?' k ' e = u N (t)Y,3U N (t,M. + ,3), 

J=l 

where (u N (t), U N (t, M + , N)) are given by (J0.104JI . Then we get by a generator calculation the formula: 



/ 

m r 
o 



(0.127) E[xUht)] = E[exp{--jTl^d U )], i € {!,-■ -,N}. 



Hence if we start initially with x^ (i, 0) = 1; i = 1, • • • , N we see that in order to observe a mean 
which is less than one we need an occupation measure of the population from the branching coalescing 
random walk which is of order N. Due to the exponential growth this amounts to II*' 1,1 to grow up to 
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order N. Since we proved the latter behaves at times a 1 log N + t for very negative t approximately 
like 

(0.128) Wexp{a{a- 1 log N + t)) = e at ■ WN, 

the advantageous type emerges since E[x^ (1, oT 1 log A*" + t)] is as N — > oo for t € M. strictly between 
and 1. By the analysis of moments we actually can prove the results on the Fisher- Wright diffusion 
model stated in section [1] from the results of the coalescing particles model stated in section [2j In 
section 7 of |DGsel| methods are developed to turn this idea into rigorous mathematics. 

3.2 The genealogy and duality 

The formula (|0.125|) can be understood on a deeper level, since the dual process can be interpreted in 
terms of the genealogy of the Fisher- Wright diffusion model. This will also exhibit the role of selection 
a bit better. 

For that purpose the Kingman coalescent has to be viewed as a partition-valued process. This 
means its states are the partitions of the set {1, ••-,&} starting in {{1}, • • -,{k}} where partition 
elements coalesce at rate d independent of each other (and in the spatial case perform continuous time 
random walks at rate c) . If we consider a Fisher- Wright diffusion it can be viewed as the limit of the 
Moran model taking the population size to infinity. Pick then from the population at time t exactly 
k individuals and look at their genealogy. This genealogy has the same law as the genealogy of the 
Kingman coalescent with the genealogical distance between two individuals defined to be the first time 
they are in the same partition element. Given this genealogy we can calculate the probability, that 
all individuals have the same type, one, which is x k if x is the current (time t) frequency of type one, 
in terms of the Kingman coalescent getting the duality relation (|0.125f> . See [GPWmetricj , |GPWmp| 
for more details on the genealogical processes. 

If we include selection note that if we follow the tagged sample from the population back one of the 
individuals might interact with the rest of the population (note now individuals under selection and 
mutation arc not anymore exchangeable) by a selection event. What results from this event however 
now depends on the current types of the two involved individuals. Therefore we have to also follow 
the other individual further back. This means we expect that each time a selection event occurs our 
tagged sample has to be enriched by a further particle. This is reflected in the birth of new individuals 
in the dual particle model. 

In order to handle multiple types and mutation the point now is that the dual process has to 
be complemented by a function- valued part. The basic idea behind this is explained in the next 
subsection. 

3.3 The dual for general type space 

The picture from above for two types is more subtle, if we consider a multitype situation with more 
than 2 types. The basic new ideas for this purpose in |DGsel| we sketch next. 

3.3.1 A multitype model 

The process considered in section Q] can be defined for general type space I (a subset of [0,1]) as 
(V (I)) -valued process (S = a finite or countable geographic space) and is called the interacting 
Fleming- Viot process with selection and mutation, see |DG99j . (which becomes a multitype Fisher- 
Wright diffusion in the case of finitely many types and the model of section [T] for two types). Then 
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types have a fitness given by a function and mutation occurs via a jump kernel which are denoted 
respectively by 



(0.129) X ■ I 



< \ < 1, M(x, dy) a I x I-probability transition kernel. 



The generator of the nonspatial Fleming- Viot process acts on monomials of order n with test function 
/ 6 C-fe(I) 



(0.130) F(x) = / /(«!, • • ■ , «„)a:(d«i) ■ • • s(d«„), x £ P(I), 
as follows, with setting Q x (du, dv) = x(du)5 u (dv) — x(u)x(dv): 

(0.131) (GF)(x) = s y'|^M (u) L (u )_^ xHa;(du; ^| a;(du) 



ajF(:E) («)M( U ,d V )-^^(u) 



ftr 



dx 



d 2 F(x) 
dxdx 



(u 1 v)Q x (du,dv) 



i i 



x eV( 



In the spatial case a corresponding drift term from migration appears as well: 
(0.132) 



i,jes 



r -s / ( dF(x) dF{x) , 
a ^ ,J ' I ^~dx~^ U ' Q^—m)Xi(du). 



In |DK| a dual for a Fleming- Viot process with mutation and selection is introduced in order to 
show that the process is well-defined by its martingale problem. In |DGsel| a new dual was developed 
which makes possible the study of the long-time behavior and the genealogy of the system. This we 
explain now. In order to introduce the main ideas we first consider the case with no mutation and 
the special case TV = 1 and then the general case. 

3.3.2 The dual with selection 

Let now the mutation rate be zero and let \S\ = 1, so that the state space is T-'(I) and we only have 
resampling and selection. Consider the class of functions 



(0.133) F((n, /),*):= 



f(ui, . . . , u n )x{du 1 ) . . . x(du n ) 



for all n £ N, / £ Loo((I) n ,M) and x £ V(T). Note that given a random probability measure X on 
■p(I), the collection 

(0.134) {E[F((n, /), X)],ne N, / £ ^((I)", R)} 

uniquely characterizes the probability law of X. 
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The class given above contains the functions which we will use for our dual representations. In 
fact it suffices to take smaller, more convenient sets of test functions /: 

n 

(0.135) f{u x ,...,u n ) = JJ /<(«*)> /fGiooGO- 

If we consider the case when I is finite, it would even suffice to take functions 
(0.136) /,-(«) = l,-(«) or /,-(«) = l Aj (u), 
where lj is the indicator function of j G I. 

The function-valued dual processes (f]t,Tt)t>o and (r] t , T~[ )t>o are constructed from the following 
four ingredients: 

• N t the number of individuals present in the dual process which is a non-decreasing N-valucd 
process with N = n, the number of initially tagged individuals, and N t > n, 

• Ct = {lj • • ' j Nt} is an ordered particle system, 

• TT t : is a partition (jrl, • • ■ , 7rj ) of Ct, i-e. an ordered family of subsets, where the index of a 
partition element is the smallest element of the partition element, 

• JFt is for given states of Tr t ,(t,N t a function in L 00 (l' 7r *') which is obtained from a function 
in L M (I ') by setting variables equal which are corresponding to one and the same partition 
element and Tt changes further driven selection (see below). 

Definition 3.1 (Evolution of (t],T) and (n,T + )) 

(a) The process n is driven by coalescence at rate d of every pair of partition elements and by the 
birth of a new individual at rate s, which forms its own partition element. 

(b) Conditioned on the process n the evolution of J- is as follows. 

• The coalescence mechanism: If a coalescence of two partition elements occurs, then the corre- 
sponding variables of Ft are set equal to the variable indexing the partition element, i.e. for 
Tt- = g we have the transition (here Uj denotes an omitted variable) 

(0.137) g{ux,---,Ui,---,Uj,---,u m ) — > g(ui,--- ,%, ■■■,u m ) 

= g(ui,- ■-,«»,•■ -,Uj-i,Ui,Uj+i, ■ ■ ■ ,u m ), 

so that the function changes from an element of L oa (I m ) to one o/Loo(I m_1 ). 

• The selection mechanisms: 

— Feynman-Kac. For Tt, if a birth occurs in the process (£ s ) due to the partition element 
to which the element i of the basic set belongs, then for Tt- = g the following transition 
occurs from an element in L oc ((T) m ) to elements in L< XJ ((T) rn+1 ): 

(0.138) g(m, ..., u m ) — ► x(ui)g(ui, ..., u m ) - x(«m+i)ff(«i, ■ ■ ■ , u m ). 
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— Non-negative. For J-~ t + the transition \0.138\) is replaced by (provided that x satisfies < 

(0.139) g{u 1 ,---,u m ) — >g(u 1 ,...,u m+1 ) = (xK) + (l-x("m+i)))ff("i, • • • ,u m ). □ 

Now we can obtain two different duality relations, the first below working in all cases and a second 
below working in a large subclass of models. 

Proposition 3.2 (Duality relation - signed with Feynman-Kac dual) 

Let (X t ) t >o be a solution of the Fisher-Wright martingale with finite type space I, fitness function \ 
and selection rate s with X$ = x G 7-* (I). Let To = f G L 00 ((Tj n ) for some n G N. 
Assume that to is such that: 

(0.140) EMexpfs/ \ir r \dr J J < oo. 

Then for < t < to, (r)t,Tt) is the Feynman-Kac dual of (X t ), that is: 

(0.141) E[F(( Vo J),X t )]=E {vo ^ o) l exp('fWr\dr) 



J"t(ui, ■ ■ .,u\„ t \)x(dui) . . . x(du\ Vt \) 



L : 



where the initial state (^0,^0) is for n G N chosen given by 

(0.142) n Q = [({l},{2},...,{n})], 

Jo = feC b (I n ). D 

Remark 10 In [DG9V^ it was shown that there exists to > for which pTTJW is satisfied. 



The disadvantage of the dual above is the exponential term together with the signed function. This 
involves the interplay of a cancelation effect and the exponential growth factor which is often hard to 
analyse as t -H> 00. The key observation is that if the fitness function \ is a bounded function then we 
can obtain the following duality relation that does not involve a Feynman-Kac factor and preserves 
the positivity of functions: 

Proposition 3.3 (Duality relation - non-negative) 

With the notation and assumptions as in Proposition lS. 2\ (except with W.138]) replaced by 110.139]) ) 
we get for x with 

(0.143) < x < 1 

that for all t G [0, 00) we have: 

(0.144) E[F(( m J),X t )]=E (vo ^ ) 



J" t + (ui, • • • ,Ufr t \)xi(dui) ■ ■ ■ XQn t \)(du\„ t \) 



Moreover, T^ is always non-negative if T§ is. 



U 
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3.3.3 The dual with migration, selection and mutation 

We now consider the case of (|0.3[) . (|0.4[) including selection and migration but for the moment setting 
m = 0. The partition elements of the dual now have locations given by 

(0.145) &:7r t — ►fi£* 1 . 

The corresponding additional dual migration dynamics is as follows. At rate c each particle can 
jump from its current location to a randomly chosen point in {1, ... , A}. In the limiting case (N — > oc) 
the particle always migrates to an empty site which for convenience we can take to be the smallest 
unoccupied site neN. This then results in precisely the logistic branching particle model respectively 
the Crump- Mode- Jagers process described in section [2j 

The duality relation is now given by 



(0.146) E[F((r, J),X t )]=E i 



(fo.-^o 



-1 



J^{u u ■ ■ ■ ,u^ tl )x Ml) (du x ) ■ ■ ■ x Ml „ tl) (du M ) 



where X n = (xi, . . . ,x n ), Xi G 7>(I), i = 1,...,N, 770 = (Co,tto,£o). 

The mutation can now be incorporated by adding a corresponding transition of J 7 , at rate m for 
every variable the operation acting on g 6 (L°°(T)) n by 

(0.147) g(ui,u 2 ,--- ( «n) — > / g(ui,- • • ,v,Ui+i, . . . ,u n )M(ui t dv). 

Then the relation (|0.144[) holds with mutation. 

Example of application Now consider the case with m > 0, I — {1,2} as in section 1 and 
J-q = 1\. This effect of rare mutation from type 1 to type 2 results in the dual in the transition 

(0.148) li ->■ at rate — . 

Returning to our duality relation we now see that the particles in our process 77 stand for possible 
individuals, represented by the factor li which could by mutation represent a possible line through 
which the advantageous type can enter the population. More precisely, if To = li, then we can check 
that as time increases we have an increasing number, lT^' 1,1 of such factors and any of these can 
undergo the rare mutation transition. 
Therefore 

t 



(0.149) V t N = ^Jll^du 





represents the hazard function for a rare mutation to occur and therefore (1 — exp(— V t )) the mean 
of x^ (t). Hence the growth of the spatial intensity in the logistic branching particle model relates to 
the emergence of the rare mutant population. 

As a simple application of this dual we can now verify (|0.13|) . (|0.14[) when there is only one site. 
In the case d = we have a linear birth process with birth rate s and therefore V t N grows like e st and 
therefore the first rare mutation occurs after a time of order 0{ ° s ). On the other hand when d > 
the process does not grow indefinitely but approaches an equilibrium. In this case V t N grows only in 
a linear fashion and therefore requires time of order O(L). 

Wc note that since in the process, respectively the dual, time has to be read forward, respectively 
backward, from t, the rare mutation jumps occuring in times a -1 log A + t for the dual after the 
expansion of the population up to time a -1 log N, correspond to rare mutations in the original process 
occurring at times between and t and then growing till the takeover after time oT x log A. 
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3.4 Outlook on set-valued duals 

If we consider |I| < oo and use for J-g products of indicator functions we obtain under the evolution 
sums of products of indicator functions where the dynamic of the different summands is coupled by 
the transition occuring in the underlying process r\. Here we briefly describe the main idea how to 
describe the dual based on a set-valued process but where we introduce the order of individuals and 
we use a change in the coupling between summands. 

To explain the main idea we again consider the case I = {1, 2}, N = 1. Now consider the case 

(0.150) Jo = 1 3 . 

Then at the random time r at which one selection operation occurs we have 

(0.151) .F T = 1 <g> 1 2 + li <g> 1 2 . 

We can now regard this as defining a subset of I 2 . If we now couple the transitions in the two 
summands differently, namely we write instead of (|0.151|) (this we can do since the dual expression 
depends only on the marginal law of the summands not the joint law) 

(0.152) 1 2 <g> 1 + li <g> 1 2 , 

then we can ensure that the summands continue to correspond to disjoint subsets of I and therefore 
we obtain a dual process with values in subsets of I . 

A much more complicated set-valued dual can be constructed for the general multitype Fisher- 
Wright diffusion with mutation, selection and migration. The key point is to first take the non-negative 
function-valued dual driven by the particle system we introduced and then to introduce the order of 
factors and an appropriate coupling of its decomposition into a set of summands. This allows us to 
obtain a duality relation for general finite type space and additive selection with a bounded fitness 
function. 

This duality relation can be interpreted in terms of the ancestral lines of a tagged sample of 
individuals picked from the time-i population, since the dual now gives a decomposition in disjoint 
events for the ancestral lines and genealogical tree for a tagged sample of n individuals from the time 
t population from which we can read off current types and genealogical distance of the tagged sample. 

References 

[DGscl] D. Dawson and A. Greven: On the effects of migration in spatial Fleming- Viot models with 
selection and mutation, in preparation 2010. 

[Bu] R. Burger (2001). The Mathematical Theory of Selection, Recombination, and Mutation, 

Wiley. 

[DG99] D. A. Dawson and A. Greven (1999), Hierarchically interacting Fleming-Viot processes with 
selection and mutation: Multiple space time scale analysis and quasi equilibria, Electronic 
Journal of Probability, Vol. 4, paper no. 4, pages 1-81. 

[D] D. A. Dawson (1993), Measure-valued Markov Processes. In: Ecole d'Etc dc Probabilites 

de Saint Flour XXI, Lecture Notes in Mathematics 1541, 1-261, Springer- Verlag. 

[DG2] D. A. Dawson and A. Greven (1993b), Multiple time scale analysis of interacting diffusions. 
Probab. Theory Rel. Fields 95, 467-508. 



REFERENCES 29 



[DG3] D. A. Dawson and A. Greven (1993c), Hierarchical models of interacting diffusions: multiple 
time scale phenomena. Phase transition and pattern of cluster-formation. Probab. Theory 
Rcl. Fields, vol. 96, 435-473. 

[DK] D.A. Dawson and T.G. Kurtz (1982). Applications of duality to measure-valued diffusions, 

Springer Lecture Notes in Control and Inf. Sci. 42, 177-191. 

[EK4] S. N. Ethier and T. G. Kurtz (1994), Convergence to Fleming-Viot processes in the weak 
atomic topology, Stochastic Process, Appl. 54, 1-27. 

[Gar] J. Gartner (1988), On the McKean-Vlasov limit for interacting diffusions. Math. Nachr. 

137, 197-248. 

[GPWmp] A. Greven, P. Pfaffelhuber and A. Winter, Tree-valued resampling dynamics: Martingale 
Problems and applications, submitted to PTRF 2010. 

[GPWmctric] A. Greven, P. Pfaffelhuber and A. Winter (2009), Convergence in distribution of random 
metric measure spaces (The A-coalescent measure tree), PTRF, Vol. 145, issue 1, 285 ff. 

[Hu] M. Hutzenthaler (2009), The virgin island model, Electr. J. Probab. 14, 1117-1161. 

[J92] P. Jagcrs (1992), Stability and instability in population dynamics, J. Appl. Probab. 29, 

770-780. 

[JN] P. Jagers and O. Nerman (1984), The growth and composition of branching populations, 

Adv. in Appl. Probab. 16, 221-259. 

[Mc] H. P. McKcan, Jr. (1966), A class of Markov processes associated with nonlinear parabolic 

equations, Proc. N.A .S., U.S.A. 56:1907-19 

[N] O. Nerman (1981), On the convergence of supercritical general (C-M-J) branching processes, 

Zcitschrift f. Wahrscheinlichkcitsth. vcrw. Gebiete, 57, 365-395. 

[PY] J. Pitman and M. Yor (1982), A decomposition of Bessel bridges, Z. Wahr. verw. Geb. 59, 

425-457. 

[RW] L.C.G. Rogers and D. Williams (1987). Diffusions, Markov processes and martingales, Vol. 

2, Wiley. 

[SchirmlO] F. Schirmeier (2010), A spatial population model in separating time windows, master thesis, 
Department Mathematik, Erlangen. 



